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Abstract. The control landscape for various canonical quantum control 
problems is considered. For the class of pure-state transfer problems, analysis of 
the fidelity as a functional over the unitary group reveals no suboptimal attractive 
critical points (traps). For the actual optimization problem over controls in 
L-^(0, T), however, there arc critical points for which the fidelity can assume any 
value in (0, 1), critical points for which the second order analysis is inconclusive, 
and traps. For the class of unitary operator optimization problems analysis of 
the fidelity over the unitary group shows that while there are no traps over 
U( AT) , traps already emerge when the domain is restricted to the special unitary 
group. The traps on the group can be eliminated by modifying the performance 
index, corresponding to optimization over the projective unitary group. However, 
again, the set of critical points for the actual optimization problem for controls in 
L-^(0, T) is larger and includes traps, some of which remain traps even when the 
target time is allowed to vary. 
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1. Introduction 

Although quantum theory has been in existence for about a century, until recently, the 
main emphasis in the field was on constructing Hamiltonian models and solving the 
Schrodinger equation. Although it was recognized that external fields and potentials 
could change the Hamiltonian and thus the dynamics of a system, and such external 
fields were certainly used in many areas from nuclear magnetic resonance to atomic 
physics to effect changes to the system, it was only rather recently that the full 
potential of using such external fields was recognized. Since then the subject of control 
of quantum systems has developed from a niche area into a subject of rapidly growing 
interest and importance, with an ever increasing number of applications ranging from 
quantum chemistry to quantum information processing (see e.g. jl]). Quantum control 
has been applied, for instance, to influence the outcome of chemical reactions [2], 
to prepare entangled states [3], which are a resource in quantum metrology and 
information processing, and to realize quantum gates [1], which are fundamental 
building blocks for a quantum computer. The scope of the applications is vast. 

As the potential and importance of quantum control was realized, attempts 
were made to formulate quantum control problems abstractly, and establish solid 
mathematical foundations. Some pioneering attempts were necessarily imperfect and 
the theory is still under development, but there have been a number of interesting 
theoretical results. One important area is the control landscape b'.. While it 
is not too difficult in many cases to formulate a particular task in terms of a 
control optimization problem, i.e., of finding an admissible control that maximizes a 
performance index, solving the resulting control optimization problems is challenging, 
especially for large or complicated systems, and in almost all cases they can only be 
solved computationally using numerical optimization strategies. Since the evaluation 
of the performance index for quantum control problems generally requires the solution 
of an operator or partial differential equation, evaluation of the latter becomes 
computationally demanding and efficient algorithms are therefore crucial. Such 
algorithms exist for finding local extrema, especially when gradients can be computed 
relatively cheaply compared to the computational overhead involved in evaluating 
the performance index, as can be achieved in our case. We are however primarily 
interested in global optima of that function, and this is where the "landscape" defined 
by the objective function becomes relevant. 

If the landscape is littered with many suboptimal, locally attractive critical points 
then optimization strategies designed to find local extrema have a very low probability 
of finding a global optimum from a generic starting point. In such a situation global 
search strategies are usually required, involving substantially greater computational 
overheads. If the landscape is nice, however, i.e., if most local extrema are actually 
global extrema, then algorithms that find local extrema are likely to succeed in finding 
a global optimum and the computational overhead of global search algorithms can be 
avoided. The landscape for quantum control problems was initially expected to be 
complicated, but it was quickly realized that optimization algorithms designed to find 
local extrema were often very successful at finding controls for which the performance 
index to be optimized took values very close to its global optimum. This has led 
a number of researchers to study the issue and put forward certain arguments why 
this should be the case. The core argument was that, contrary to expectations, the 
landscape for quantum control was actually extremely simple and devoid of any traps, 
i.e., suboptimal locally attractive critical points. 
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The main objective of this paper is to revisit this topic, taking a closer look at 
the control landscape. Previous studies [ilHIHllllinillllllllllllllin] have almost 
all yielded results that only apply to regular critical points, in particular showing that 
these can not be traps for typical quantum control problems. However, it is well-known 
in the broader field of optimal control that these are not the only critical points [TB] , 
and that the role of such non-regular critical points requires analysis [171 UHl US] . For 
quantum control problems the existence of non-regular critical points has only recently 
been acknowledged [20] but their properties have not been studied. In particular, it 
is not known if such critical points can be traps. The existence of such traps would 
complicate the control landscape and invalidate some general results arrived at by 
considering only regular points. The main objective of this paper is to show that 
non-regular critical points do exist for a large class of quantum control systems and 
that at least some of these are suboptimal extrema for which the performance index 
can take many values. The particular examples we construct to rigorously prove this 
claim are quite artificial, but they prove that such bad points do exist. For problems 
of practical interest it is generally extremely difficult to rigorously prove that a critical 
point is a trap as these problems can only be solved numerically. However, we present 
numerical evidence that suggests trapping behavior does occur for practical quantum 
optimal control problems even if such behavior appears to be relatively rare, at least 
for the simple systems that have been studied extensively to date. This suggests that 
further analysis is required to fully characterize the control landscape, including the 
nature of potential traps and their practical relevance, e.g., for the design of efficient 
algorithms. 

The paper is organized as follows. In Sec. II, we formally define the problem 
and give the mathematical prerequisites. In Section HI, we take a closer look at the 
landscape for pure-state transfer problems. In Sec. IV we consider the landscape for 
unitary operator optimization problems. Finally, in Sec. V, we present the results of 
a numerical investigation of the control landscape for a three-qubit gate optimization 
problem, before concluding with a brief discussion in Sec. VI. 

2. Background and Mathematical Prerequisites 

The focus of this paper is quantum systems subject to Hamiltonian evolution 
governed by the Schrodinger equation. Although the latter generally applies to states 
evolving over an infinite-dimensional Hilbert space, we restrict our attention to finite- 
dimensional systems. This is the usual context for work on control of quantum systems 
and motivated by the fact that in practice one is usually either dealing with a system 
that intrinsically has a finite number of degrees of freedom such as a collection of spins, 
or for which the Hilbert space can be faithfully truncated, and the controlled dynamics 
is restricted to finite-dimensional subspace of interest. Assuming this Hilbert space 
has dimension N, the evolution of the system can be described by a unitary operator 
Uf{t) £ U{N) satisfying the Schrodinger equation 

ihtlfit) = Hf{t)Ufit), Uf{0)=L, (1) 

where / is the identity matrix in U(iV) and Hf{t) is the Hamiltonian of the system, 
which in our case will be control-dependent, as will be indicated by the subscript /. 
For simplicity the control dependence will be assumed to be linear in the (real- valued) 
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controls fm{t), 

M 

Hfit)^Ho+J2fn.{t)H„,, (2) 

m— 1 

where Hq is the system's intrinsic Hamiltonian and Hm are control interaction 
Hamiltonians. In the following we set M = 1 unless otherwise stated, choose units 
such that h= 1, and define the two-point propagator (7/(^2, ii) = U f {t2)U f {tij^ . 

2.1. Control problems 

Most typical control problems for (Hamiltonian) quantum systems fall in one of the 
following three categories: pure-state transfer, observable control and unitary operator 
implementation. The pure-state transfer problem consists of preparing the system in 
a desired state \^ g), usually at a given time T, assuming it is initialized to some state 
l^o) at time 0. A natural measure of success in achieving this goal is the transfer 
probability 

Jp(/) = |(*,|*(T))p, (3) 

where in our case |*(r)) = C//(T)|^'o) and Uf{T) is a solution of 0. For a given 
Hamiltonian, initial and target states, and fixed target time T, the transfer probability 
depends only on the choice of control /, i.e., if the controls vary over L^(0,r) 
then : L^(0,r) i— )■ M. The observable control problem lies in taking a system 
characterized by an initial density operator po at time to a state p{T) at time T 
for which the selected observable A attains its maximal possible value, i.e., we aim to 
maximize the expectation value 

5z5(/) = Tr[Ap(r)], (4) 

where p{T) — Uf{T)poUf{TY and Uf{T) is again a solution of ([l]). The gate synthesis 
control problem consists in choreographing the system dynamics to implement the 
unitary gate V over the time interval [0,T]. Using the Hilbert-Schmidt (Frobenius) 
norm to measure distance from the target gate V, this leads to the gate fidelity 

dv{f)^^mcTT[V^UfiT)] (5) 

as a natural performance index, where [// (T) is a solution of ([I]) as before. 

Broadly speaking, the objective of quantum control is to find a control / to 
maximize one of these performance indices, where / is allowed to vary over a large 
function space, while T is fixed unless stated otherwise. In practical implementations 
of optimization algorithms for these problems, / will always be restricted to a finite- 
dimensional subspace, but this subspace may be arbitrarily large, and we will be 
chiefly interested in the ideal case where / varies over L^{0, T) in this paper, except in 
Sec. 5. So in general, we wish to find the (global) maximum of a (non-linear) functional 
5^ : L^(0,T) — 7> M, which in our case is either ^Jp, g^u, or g^y. It will often be fruitful 
to think of 5^ as a composition of the solution functional Uf{T) : L^{0,T) U(iV) 
taking a control field / to the corresponding propagator at time T, U{T), via ([ij, and 
the fidelity on the Lie group & : U(A^) — > M, and we shall use 

'Sp{U) = \{^g\U\^o)\\ eDiU)^Tr[AUpQU% ©v(C/) = ^ Tr[WC/], (6) 
respectively, to denote the equivalents of ^p, and above. 
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2.2. Controllability considerations 

Whether we aUow the fields /,„ to range over the whole of L^{0, T) or a smaller space 
such as the continuous or piecewise constant functions, we must consider whether 
the aforementioned control problems are feasible, i.e., whether the maximum of the 
performance index is actually attainable, or the question of controllability. More 
specifically, we are interested in whether for a given system Hf{t), and all endpoint 
conditions - either {l^fo) , {po,A} or {L,V}, there exists a time T and an 

(admissible) control f that solves the corresponding control problem. In general, for 
bilinear control systems on a Lie group, as in our case, this problem reduces to one 
about the Lie algebra £ generated by the matrices {iHm}m=o- I^^r pure-state control 
problems controllability is equivalent to £ being a representation of either sp{N/2), 
sp{N/2) ® u(l), su{N) or u(A^). For mixed-state (density-operator) control problems, 
we require su{N) or u(A^), while the system is fully unitary operator controllable only 
when £ = u{N) [5T]. Note that the distinction between su(iV) and u{N) is artificial 
in the quantum case as the extra dimension contributes only a global phase, which 
is generally not physically observable. Since it is convenient, and indeed common, to 
fix the final time T when carrying out control optimization, we are also interested in 
the stronger notion of exact-time controllability, namely, whether a system has some 
critical time Tc such that all endpoint conditions give solvable control problems for 
any T > Tc. It turns out that, neglecting global phase, these notions of controllability 
are equivalent to the previous three by Theorem 13 of [22] . 

Apart from these now standard results, we will need the following. 

Theorem 1. Let H ~ Hq + f{t)Hi be a control system, £ = u(A^) or su{N) and 
G ~ U(A^) or SU(A^) be associated Lie group. 

• // the set of all commutator expressions in iH^ and iHi joined with iH^ and 
iLIi span £ (Lie algebra rank condition) , then there exists a maximal time T^ax 
and neighborhood Af of (Hq, Hi) in i£ x i£ such that for all systems s € Af and 
elements g G, there is a control taking s to g in time T < Tmax- 

• // the set of all commutator expressions in iHo and iHi joined with iHi span 
£ (exact-time Lie algebra rank condition^, then there is a critical time Tc and 
neighborhood Af of {Hq, Hi) S i£ x z£ such that for all s € Af, g € G and T > Tc, 
there is a control taking s to g in time T. 

Proof. Assuming that our system specified by {Hq,Hi) satisfies the Lie algebra rank 
condition, by Theorem 1 of [22 there is a set of values wi, . . . , w„ such that the function 



has Jacobian dF of full rank at some point t. This implies that the extended function 



also has Jacobian dF' = I o / o j of full rank at the point {Hq, Hi,t). Thus by the 



inverse function theorem, choosing neighborhoods JV of (Hq, Hi) and F of t bounded, 
there is a neighborhood W of F{t) such that F'{{A} x {B} x V") includes {A} x {B} x W 
for every (A, B) e Af. By compactness (and connectedness) , some power W'^ covers 




F' : {A,B,ti,...,t„) ^ (A,B,e-*('^+""-^)*"---e-^('^+"i^)*i) 




f Note again that the exact-time and usual Lie algebra conditions match for su{N). 
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the entire group G. Letting T be the supremum of J2n^n ^^'^^ ^^^^ 
systems in M the entire group is reachable in less than time kT. 

Assuming {Hq,Hi) satisfies the exact-time Lie algebra rank condition, the same 
argument may be repeated with 

F : (to, . . . , t„) ^ g-i(Ho+^.„Hi)t„ . . . ^~t{Ho+voHr)to 

as constructed in the proof of Theorem 3 in [22] , where now F has domain restricted to 
J2n = T. This alteration shows that the critical time of exact-time controllability 
can indeed be chosen uniformly about {Ho,Hi), as claimed. □ 

2.3. Gradient and Hessian formulas 

Finding the global optimum of a function is generally a very difficult task. On the 
other hand, there are many efficient algorithms to find local extrema, i.e., attractive 
critical points of a function. For this reason we are particularly interested in the nature 
of the critical points of the various functionals 5^, which depends only on certain local 
properties of the solution functional Uf{t). To study the latter, we need the identity 

Uf+Af{t) - Uf{t) = -I f Uf{t,T)[Hf+AfiT) - HfiT)]Uf+Af{r) dr (7) 

Jto 

which can be verified by differentiating both sides, and used to derive 
Uf+Af{T) = Uf{T)-i r Uf{T,r)AH{T)Uf{T)dT 

-[ r Uf{T,T)AH{T)Uf{T,a)AH{a)Uf{a)dadT + 0{\\Aff) (8) 

jQ JQ 

where AH{t) = Hf+Afir) - Hfir) is just Af{T)Hi in the M = 1 case. For 
a, P G L^(0,T) we define the linear map 

£ dr, ^-fj^ = -^UJiT,r)H,UJir), (9) 

corresponding to the gradient of Uf{T) at /, and the bilinear map 
(a,/3) K> - y" J Uf{T,T)AH{T)Uf{T,a)AH{a)Uf{a)[a{T)/3{a) + l3{T)a{a)]da dr 

0<cr<T<T 

(10) 

corresponding to its Hessian. Indeed, the second order perturbative expansion ([s]) is 
the functional analog of the Taylor expansion of a function, with the gradient and 
Hessian functionals corresponding to the first and second order terms, respectively. 
Since the Hessian is naturally a linear operator mapping L^(0,T) to itself, it is 
convenient to have notation for referring directly to some such operators, along with 
the elements of themselves. We shall use H to denote the unnormalized projection 
onto its argument, and • as a placeholder for the argument of the anonymous function 
it is a constituent of. For example, H[cos(w»)] refers to the projector onto the function 
t I— >■ cos(wt), that is, the operator that maps each vector a G L^{Q,T) to the vector 

/J'cos(a;T)a(T)drcos(w») in L^(0,T). 



Quantum Control Landscapes: A Closer Look 



7 



The importance of the gradient and the Hessian functionals is that the former 
determines the critical points of the functional ^, and the latter their nature. In 
particular, if the Hessian at a critical point / is negative definite, then the critical 
point is attractive and corresponds to a local maximum over any finite-dimensional 
subspace of L^(0,T); similarly, if it is positive definite, the critical point is repulsive 
and corresponds to a local minimum over any finite-dimensional subspace, and if the 
Hessian at the critical point can take both positive and negative values, the critical 
point is a saddle. 

3. Pure-state landscape 

3.1. All Critical Points over SU(A'^) or \J{N) global extrema 

As mentioned in the introduction the problem of finding a control / S L^(0,T) 
to maximize the pure-state fidelity functional ([s]) can be viewed a composition of 
finding a U^, E U(A^) or SU(iV) that maximizes ^p, and an optimization problem of 
finding a control in L'^{0,T) that realizes Uf{T) = Defining A = |\I>g)(vI/g| and 
po = |5'o)(4'o|, we have |(*g|;7|*o)P = Tr [A [/po i-e., we can rewrite the pure- 
state optimization problem as a general observable optimization problem. It is easy 
to show that a necessary and sufficient condition for U € U(A'') or SU(A^) to be a 
critical point of Tt[AUpoW] is that A and UpoU^ commute, i.e., [A,UpoW] — 0. In 
our case, as A and po (and thus UpoU^) are projectors onto pure states, there are only 
two types of critical points: (i) A and U poU^ are projectors onto orthogonal subspaces 
of H, in which case we have Ti-[AUpoW] = 0, and (ii) A and U p^U^ are projectors 
onto the same ID subspace of "H, in which case we have Tt:[AU p{)W] = 1. Thus, the 
landscape for pure-state optimization over \J{N) or SU(A'^) is very simple: there are 
only two types of critical points, corresponding to extremal values of the fidelity, i.e., 
global extrema, and no saddles or other critical points for which the fidelity assumes 
values in (0, 1). The critical points over SXJ{N) or U(iV) are sometimes referred to as 
kinematic critical points |20j . 

Since then several papers have attempted to show that this result extends to the 
actual optimization problem over L^(0, T), i.e., that the fidelity as a functional over 
L^{0,T) only has critical points for which it achieves either its global minimum or 
maximum 1. However, the arguments put forward in [6] and [7] rely, without rigourous 
justification, on the property that the solution functional / i— >■ Uf{T) is of full rank 
everywhere. |13| gives another proof but relies on the same problematic assumption. 
[5] argues that a certain sequence of expressions starting with equation (14) and (15) 
therein generate the full Lie algebra generated by iHo and and hence, assuming 
controllability, span the entire Lie algebra su{N). But this procedure only allows 
one to generate specific linear combinations of commutators [§] and, although it may 
seem plausible, these are not guaranteed to span su{N). For the more general density 
matrix control problem, [3] similarly assumes that the map / H> Uf{T) is of full 
rank everywhere to derive that the final state Uf{T)poUf{Ty must commute with the 
observable A at critical points, and hence that and 1 are the only critical values for 
pure states. [TT] makes use of the same problematic assumption, referring to [S] as 
evidence of its validity, to re-derive this critical point characterization, along with the 
result that all but the global maxima and minima are saddle points. p3] recognises 

§ There appears to be a mistake in equations (12-15) but our assertion still applies when the corrected 
expressions are used. 
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the possibility of singular controls /, here of / i— > not being full rank for 

a particular control /, along with the issues this raises, yet defers analysis of such 
points to future work. [5^ finally gives a characterization of singular controls and 
acknowledges that these can be critical points of the performance index that do not 
correspond to kinematical critical points, but considers only one example involving 
eigenstates of a four-level system, to conclude based on numerical simulations that 
singular controls do not appear to be traps for this problem. 

Therefore, the question of the existence of non-kinematic critical points for the 
pure-state fidelity ([3|, i.e., critical points / G L^(0,T) for which does not assume 
extremal values, remains open. We now show that such critical points do exist for all 
control-linear Hamiltonian systems, and while most of them cannot be expected to 
be attractive, one can construct examples where calculation of the Hessian at such a 
non-kinematic critical point shows it to be strictly negative definite. Thus, not only 
do non-kinematic critical points exist but they can be attractive, i.e., traps. Hessian 
calculations also show that the Hessian need not have finite rank at such critical 
points. The main implication of these examples is that landscape analysis based on 
considering regular points only is incomplete, and that in some cases at least, traps 
definitively do exist. They may be generally rare but even a single trapping point can 
be problematic if it has a large domain of attraction. It should be noted here that 
the mere existence of singular controls implies that the end-point map / i— > '^f{T) 
cannot be full rank everywhere, and thus that the hypothesis on which most of the 
landscape results cited above are predicated, cannot be satisfied in general, so that 
further analysis is necessary. But this still leaves open the question whether the results 
are true in the absence of this hypothesis, and the significance of exhibiting examples 
for which traps provably exist is to show that the landscape is not universally trap-free. 

3.2. Any critical value possible for critical points over L^{0,T) 

Let us start by considering, for any choice of final time T, a pure state control problem 
for a two-level system of the general form: 

(*,| = -^=|l.e-le'™", |*„) = -^=[e-",e-"'+»>F, (lib) 

where we can exclude the degenerate case = j'J'o) by choosing 6 ^ 

(mod it). For the field / = which is identically zero over [0,r], we can calculate 
{^g\Uo{T)^o) = cos{0), so that the fidelity S'p is cos^(6i). Moreover, / = is a critical 
point since the general gradient formula 

'^^^ 23m ({^g\Uf{T)Uf{tyH,Ufit)\^o)WJWW^) 



Sfit) 

in this case evaluates to 

3m (ce*" + de*('^-'')*-'(''+'^) + c.c.) cos(6i) = 



for all t 



A value of 6 can be chosen to achieve any fidelity in the interval (0, 1), and this can 
be done avoiding the degenerate case by choosing the right sign for 9 when necessary. 
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So we already see there is a large family of control problems admitting critical points 
for which the fidelity assumes any possible value but this is simply a special case of 
the more general fact that follows. 

Theorem 2. Civen any bilinear control system with control Hamiltonian Ho + f{t)Hi, 
any target time T > 0, and any F € (0, 1), there exist pairs of initial and target states 
l^'o), l^'g) and a control f such that f is a critical point of the fidelity with critical 
value = F. 

Proof. Any bilinear control system Hq + f{t)Hi with f{t) ~ f{t) + /i is equivalent to 
one with Hamiltonians Hq = Hq + Hi and control f(t). 

Case 1: Suppose there is a value of /i G M such that Hq has degenerate eigenvalues. 
Choose a two-dimensional subspace S of this eigenspace. Then, restricted to the 
subspace S the Hamiltonians take the form 

(-) 

It easy to see that / = is a critical point with fidelity cos^{9 ~ ip) for any T for the 
state transfer problem i-> \^ g) on S with 

= [cos(0),sin(0)e-'T], |*o> = [cos((^), sin(^)e'^]^ 

if the Hamiltonians restricted to S take the form ( |12[ ) . Since S is an invariant space 
of U{t) = e^''^°, if the initial and final states are in 5, the formulas for the fidelity 
and gradient simplify to their analogues restricted to S. Therefore, / = is a critical 
point for the state transfer problem j^E'o) ^ for the original system if we embed 
the initial and target state into the full Hilbert space T-L in the obvious manner. 

Case 2: If the eigenvalues of Ho+fiHi are always distinct then we can continuously 
parametrize the (unique) eigenvectors w™'^'^(/x) and finin(M) corresponding to the 
maximal and minimal eigenvalues of H{ii) = i^o + fJ-Hi), respectively. Letting 
^max^ Aniin bc the maximal and minimal eigenvalues of Hi, we have the identities 

hm {v^-''{^i)\{~Hl)\v^-''{^l)) - -A^i„ hm {v'^-''{fi)\Hi\v^-^{fi)} = A'"-'^ 

lim (Wmin(Ai)l(--H^l)kmin(M)) = -A"''"'' lim (Wmin(/^)|i?l|Wmin(Ai)> = Amin 

f — oo foo 

Hence by continuity we must have {v'^'^^{fi)\Hi\v'^'^^{fi)) = (wmm(M)l^^ibniin(M)) for 
some (finite) unless A™'^'^ = Amin, which would mean that Hi is a multiple of the 
identity and so any value of fi would do. Using this /i leads to an effective system 
Hq, Hi which, when restricted to the subspace S spanned by {v"^^^ {^) , VmaxitJ-)} , and 



expressed in this basi s, is precisely of the form (11 1. Again we can extend the initial 
and target states in ( |ll| to the full Hilbert space in a trivial way so that / = is 
a critical point with fidelity cos^ for the resulting state transfer problem with the 
Hamiltonian Hq + fHi. Restriction to S is again a well-behaved procedure since 
the span of each eigenvector is invariant under the time evolution generated by the 
corresponding Hamiltonian Hq. □ 

This theorem shows that for any control system, in particular any controllable 
one, there exist pairs of initial and target states for which we can achieve critical 
values of the fidelity in (0,1). However, the initial and target states which exhibit 
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such critical points in the proof above will not in general be eigenstates of Hq. One 
might wonder whether critical points for which the fidelity does not achieve extremal 
values might be impossible if the initial and target states are restricted to eigenstates 
of the system Hamiltonian Hq, as is the case in many applications, but even in this 
case we can find counterexamples. 

Example 1. Non-extremal critical points for eigenstate transfer problem. 

Consider the system Hf{t) = Hq + f{t)Hi with 

/I o\ /I 7273 \ 

ffo - 2 , ^1 = 7273 2 

yo 4/ V 71/3 4 / 

and the initial and target states I'i'o) = [1,0, 0]""", I'^g) = [0,0,1]'^. for this 
system has a critical point at f = —1 as 6^p/Sf = |3m[3cos(i) — 5] = with 
= |sin^(T/2), where T is the target time. Indeed, we can even verify that this 
system is controllable as Hq has distinct transition frequencies and Hi is connected. 



3.3. Attractive suhoptimal critical points over L^{0,T) 



We can furthermore show that the critical points above, for which the fidelity does 
not assume extremal values, are not necessarily saddle points either. We show this for 
pure state examples but the problems can be reformulated as observable optimization 
problems with A — for a density matrix that is a projector onto the pure 

state l^'(i)). Hence the examples also show the existence of non- kinematic critical 
points for which the observable fidelity ^jj = Tt[Ap{T)] assumes non-extremal values 
and which are not saddle points. 

Example 2. Critical saddle points with negative semi-definite Hessian. Let 

Hf{t) = Hq + f{t)Hi with 



Hn = 



Given any target time T , choose the initial and target states 



(2 














1 










4 








, Hi = 


1 





1 











5 








1 





1 


Vo 








9^ 




\0 





1 


0/ 



|*o> = [cos 0, 0,0, sin ( 



[e'-"' cos6',0,0,e'y' sin 6*] 



Then f = is a critical point of the fidelity, since Uo{t) = e 
g\UoiT,t)HiUo{t)\^o) = 0, with critical value ^p = cos^i 



-itHo 



is diagonal so that 
>). The Hessian of 



the fidelity at this point can be computed as 

-2cos{e~(j)) J J [cos(6') cos((/)) cos(2(cr - r)) 

0<a<T<T 

+ sin(6l) sin((/.) cos(4(cr - r))] [a(T)/3(cr) + /3(r)a(f7)] da dr 

which just constitutes two instances of the C operator from \Appendix A\ and so can 
more clearly he written as 



-2 cosf 



cos6'cos(?!)(n[cos(2»)] +n[sin(2»)]) 
|-sin6'sin0(n[cos(4»)] + n[sin(4»)]) 
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This expression is clearly negative semi-definite whenever cos cos (j) and sin 9 sin (j) are 
both positive, equivalently, when 9 and (f> lie in the interior of the same quadrant. In 
this circumstance we are exhibiting a point f which to second order is not a saddle. 
For example, for 9 = ^ and (j> = ^ we obtain — | and Hessian taking the form 
-| [n[cos(2.)] + n[sin(2«)] + n[cos(4.)] + n[sin(4.)]] . 

By extending t^, we can compute the third order derivative of in this case to be 



12cos{9-ip) J J J [cos(6') sin(v5) sin(4p + 2cr + r) 

0<p<cr<T<T 

- sin(6l) cos((p) sin(p + 2cr + 4:T)]Af{p)Af{a)Af{T) dpda dr 



in direction A/. When T ^ ir, the first and second order derivatives of^p in direction 

( 1 if t < TT 

^(t) := < „ ,, . vanish, with the third order one beinq ^ sin (2(9 — (h)) . The 
' [0 otherwise n 7 \ \ -r// 

condition already imposed on 9 and 4> implies that \9 — < ^, hence, unless 9 — <f> ~ Q 

thus ^p = 1, one of ±7 is a direction of increase, showing that f is nonetheless a 

saddle point. 

This example shows that we cannot rule out the existence of traps by just looking 
at the gradient and Hessian. In fact, we can construct examples for which we have 
attractive critical points with fidelity in (0, 1), i.e., traps. 

Example 3. Non-extremal critical points with negative definite Hessian 
(traps). Consider the system Hf{t) = Hq + f{t)Hi with 



fl + e 0\ 

10 

2 

\ 2/ 



H, = 



r 


1 





'\ 


1 





1 








1 





b 


Vo 





b 


0/ 



Taking the target time to be T — ^, choose the initial and target states 

|*o> = ^[e'',0,0,e~^r, I*.) = -^[e-^(i+^),0,0,e-2^]T. 

For f = it is easy to verify that the gradient vanishes identically as before, so this 
is a critical point, with fidelity "Sp = cos^ 9. The Hessian is 



cn[l] — y j cos6'[cos6'cos(e(T — ct)) — sin0sin(£(T — a-))][a(cr)/3(T) 

0<CT<T<f 

with c = —6^ cos^ 9, which, using Appendix A can be rewritten as 



■ f3{a)a{T)]dadT 



ao 



n[l] + ^ afe (n[sin(2fce«)] + n[cos(2fce«)]) - cos^ 6'(n[sin(e«)] + n[cos(e«)]) 



fc=i 



with the coefficients given by oq = ^ sin(20) — 6^ cos^ 9 and Ok = ^^^^^iT) ^'^^ k > 0. 
As — cos^ 9 is always negative, the Hessian will be strictly negative definite if < 
for all k, which is equivalent to b^ cos^ 9 > sin(20) > 0. With & = 3, for instance, 
this double inequality is satisfied for all 9 G (0, and this range of 9 yields fidelities 
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'Sp = cos^ G [ji !)• Therefore, for all of these values of 9, the critical point / = 
corresponds to a local maximum, at least over any finite dimensional subspace, with 
< 1. Note that this example also shows that the Hessian at a (non-kinematical) 
critical point need not have finite rank. 

By computing the rank of the relevant Lie algebra, we can verify that Hq,Hi 
specifies an (exact-time) controllable system at e = 0, so that our uniform 
controllability result applies. Time-T controllability can therefore be ensured by 
choosing a sufficiently small e, which both makes Hq,Hi lie in the neighborhood 
of uniform controllability and T = be larger than the minimal time required for 
controllability. 



3.4. Non-constant controls 

Theorem [2] and the previous examples rehed on constant controls to show that 
critical points with non-extremal values of the fidelity exist for all systems with 
Hf{t) = Hq + f{t)Hi, i.e., any choice of Hq and Hi, and that such critical points 
need not be saddle points but can in fact be traps. It is also interesting to consider 
how the critical point condition 6^p /Sf = could be satisfied for a given system for 
non-constant controls f{t). Let = {•^g\Uf{T)\<i/o){^g\Uf{T), then the critical 

point condition reads 

3m{'<SJB\Ufit)^HiUf{t)\-^o) = 0, (13) 

where Uf{t) must satisfy the Schrodinger equation ([ij, i.e., we have a Differential- 
Algebraic equation system. Note that since Uf{T) is of course not known in advance, 
we set = (^'6|*o)(*6| for some {^^1 and then \^g) = Uf{T)\'^i,), a posteriori. 

While the existence and uniqueness of solutions to such systems is in general not trivial 
to ascertain, differentiating the constraint and using the Schrodinger equation leads 
to a more explicit form 

= 3m(1's|-ffi|*o) (14a) 
0= D\t{^B\[Ho,Hi]\^o) (14b) 
0= 3m{^B\Uf{t)^[Ho + fit)Hi,[Ho,Hi]]Ufit)\^o)- (14c) 

Assuming the first two equations hold, which can be thought of as a two (real) 
dimensional constraint on (^'b|, the system can be solved, at least locally about t = 0, 
by adjoining the constraint 

^ 3m(vl/B|t//(t)t[go, [Ho,Hi]]Uf{t)\^o) 

3m{^B\Ufit)nHi,[Ho,Hi]]Uf{t)\^o) ^ ' 

to the Schrodinger equation, under the generically true condition that the denominator 
does not vanish at < = 0. The additional, also generic, property that df / dt\t^Q ^ 
guarantees that the critical point / in question is not a constant function. As the 
fidelity S'p for the control / is K^'hj'I'o)!^ we can construct critical points for which 



the fidelity does not take extremal values. Eq. (151 can be solved numerically to find 
non-constant non-kinematic critical points for which the fidelity assumes any desired 
value, as has been done for a four-level system in [20]. While some of these critical 
points may be non-attractive, as appeared to be the case for the example studied in 
|20| there is no particular reason why all such critical points should be non-attractive 
in general. Unfortunately, it is difficult to prove this, however, due to the difficulty 
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involved in calculating the Hessian and proving it to be strictly negative definite given 
only a numerical solution of (15). 



4. Unitary Operator Landscape 

4.I. No suboptimal attractive points (traps) over \J{N) 

The set of critical points of the fidelity as a map from the unitary group U(iV) to M 
given by 0y(C/) = j^mzTr[V''U] is equal to aU U for which y{eTr[V''UA] vanishes 
for every anti-Hermitian matrix A G u{N). Since the bilinear map ©y is an inner 
product over complex matrices for which the Herniitian and anti-Hermitian matrices 
constitute orthogonal subspaces, the latter condition is equivalent to W = V^U being 
Hermitian. As W is also unitary it must be of the form W ^ Ps — Pg, where Pg 
is a projector onto a subspace S of C^, and Pg the projector onto the orthogonal 
complement of S. From this we see immediately that Tr(iy) is equal to N — 2d, where 
d is the dimension of the subspace S, and thus there are + 1 critical manifolds 
corresponding to critical values 1 — ^ of (Sv{U) for d e {0, . . . , N}. Moreover, the 
second derivative of is IHe [y^C/A^] , where A^ can be any negative semi-definite 
matrix. Thus U = ±V, corresponding to W = ±/ are global extrema, and all other 
possible W have both positive and negative eigenvalues, so that 5le [M^A^] can take 
any value, showing that all other critical points U are saddle points. 

This characterization of the critical points on U(iV), which can be traced back to 
at least [23j , provides the motivation for the claim in [8] that the landscape for unitary 
operator optimization has critical points only at these values of the fidelity, and that 
all critical points U, except U — ±V, which correspond to the global maximum 
and minimum of (&v{U), respectively, are saddle points. As in the pure-state transfer 
case, to conclude this for the optimization problem over L^(0, T) from the observations 
about the critical points of (Sy as a functional on U(A^), it is imphcitly assumed that 
the solution functional U f (T) is regular, in the differential geometric sense of having 
a Jacobian of full rank, everywhere over L^(0,T). [T^] includes the condition that 
variations of the controls can be used to generate any local variation of U{T) as part 
of the definition of controllability. As before, the full-rank-everywhere assumption is 
problematic as it can not be satisfied in general. Indeed, for any constant control 
/ = /i, the trajectory Ufj,{t) will be a linear combination of expressions e"''*'^*, where 
Xj are the eigenvalues of Hq + lJ.Hi- The gradient of the solution operator at this 
point —iUfj,{T,T)HiUf^{T) is therefore a linear combination of functions e*(^j^^'=)'^ for 
j, /c e {1, . . . , N}, or equivalently of the real functions 1, cos((Aj — Afc)T), sin((Aj— Afe)r) 
for 1 < j < fc < A^, of which there are only ~ N + I < dimu(A) = A^^. More 
generally the rank of the Jacobian where all controls are constants fj,m is at most 
A^^ — A^ -I- Af because, in the eigenbasis of iJo + S f^rnHm, the diagonal elements of 
each U^{t)'^ HmU^{T) are constant functions. The observation that constant controls 
are singular in the sense of not being full-rank has also been made recently in |20| 
but without fully considering the implications for the validity of the landscape results. 
Also, the application of results in [19] as suggested in [20] is problematic, as the former 
work, originating in sub-Ricmannian geometry requires a (strictly) positive definite 
running costs on the controls, which serves as a regularizing term and is not present in 
our case, and highly undesirable as it would prevent us from ever reaching the global 
maximum of the actual objective function. Also properties generic over the infinite 
dimensional space of vector fields, such as the main results of [19] or [17], need not 
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hold for any instance within the finite dimensional subset of right-invariant vector 

fields which wc restrict attention to. Thus, it is still not clear whether the fidelity 
over L^(0,T) has critical points that do not satisfy the criteria above, and whether 
these can be attractive. Before we study the fidelity over L^(0,T), it is useful to 
briefiy consider the critical points of ^v{U) on the Lie group in more detail, as there 
are some subtleties one should be aware of; in particular when the system evolution 
is restricted to a subgroup of U(A^) such as SU{N), attractive critical points may 
emerge even on the group and it may desirable to eliminate these by modifying the 
performance index. 

4-2. Attractive suboptimal critical points over SU{N) 

Many problems in quantum control involve control Hamiltonians Hi that have zero 
trace, i.e. iHi e su{N), in which case exact-time U(iV) controllability cannot hold. 
Indeed, we always have det[C//(t)] = e"*'^/-'^ "^[^"l = e"^, regardless of the control f{t), 
so the reachable set at time T is restricted to matrices in SU(A^) times the fixed phase 
e*"^. Hence, even assuming that the system is U{N) controllable and V,T are chosen 
such that dct{V) = e^'^ to make V reachable, when considering the critical points of the 
fidelity on the group, we should really consider the critical points of 25\/ = fHe Tr[W^] 
as a functional over W = V^U £ SU(iV). The critical point condition is now that 
fHeTr[VFyl] = for all traceless A e su{N), and thus that W be equal to a Hermitian 
matrix R plus a multiple ia of the identity with a real. Both this and the unitary 
conditions are properties of the spectrum alone, so we have in general that, W is a, 
critical point of &v whenever its eigenvalues are ie'^ and ie~'^ with multiplicities d 
and N — d, for some d E {0, . . . , [A''/2J}. Finally, the unit determinant condition on W 
forces e'* to equal exp(z g^^j^^^^ ) times any ~ 2d|"^ root of unity. Note that when 
N > 2, this set of points is larger than the set of critical points over U(iV) which also 
happen to lie in SU(7V). The second derivative of (&viW) is again j^lReTr [WA^], 
but now with A G su(N), and only the Hermitian part R oi W contributes in this 
expression, since A^ is negative semi-definite. It is clear that if R has eigenvalues of 
both signs, implying > 3, then the corresponding critical point W is a second order 
saddle. Otherwise = 0, and the critical point is a local maximum or minimum if R is 
a positive or negative multiple of the identity, while in case i? = 0, we can find curves 
aie"^^ for which &v does not vanish to third order in x, so is a saddle point. This 
characterizes the attractive critical points of &v{U) as those U of the form e^^'^V for 
some TV*'' root of unity e**^ having positive real part, and continuity of the solution 
operator Uf{T) then implies: 

Theorem 3. For any controllable system, H = Ho + f{t)Hi with Tj:{Hi) = 0, and 
any F = cos{2TTk/N) with k = 1, . . . , [iV/4] — 1, there exists an open subset J\f of 
L^(0,T) such that any monotonically increasing local optimization algorithm started 
with f gW will never exceed a fidelity of F. 

4-3. Elimination of traps by optimization overP\J(N) 

These results show that that for optimization of SU(iV), there are traps, i.e., attractive 
critical points of &v{U) with (8v{U) < 1, and this is relevant for optimization 
problems over i^(0,T) with Tr[_ffi] = 0. As the attractive critical points differ only 
by a global phase factor e*^ from the target gate V, however, and we usually do 
not care about the global phase of a gate, these traps are not really a problem in 
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practice, provided the convergence condition of the algorithm is more sophisticated 

than checking if 1 — Sv{Uf{T)) is less than a certain tolerance. However, if we do not 
care about the global phase of the gate V, a better choice of the performance index 
to be optimized would be 

e'^:U^^\Tr{V^U)\'', (16) 

which is equivalent to optimization over the projective unitary group PU(A''). To 

see why it is a better choice, note that the critical point condition for &y is 
9le [jTr{WA)] = for every A e 5u{N) with W = V^U as before, and 7 = Tr[VF]. 
This is satisfied only when = ail + R for some real a and Hermitian R with 
all eigenvalues of the same magnitude. The second derivative at these points is 
2 *He Tr[i?j4-^] + 2|Tr[M^A]p, the first term can only fail to ever be positive for any 
A e su{N) if is a non-negative multiple of the identity, ov N = 2. In the first 
case, either 7 = or = e'^J, which corresponds to &y attaining its minimal or 
maximal values respectively. Otherwise, R has eigenvalues of opposite sign, so the 
second derivative's first term must vanish for every A S su(A^), and if its second term 
always vanishes then U must have equal eigenvalues, thus maximizes &y. 

4-4- Non-global maxima for optimization over L^{0,T) 

The previous section shows that suboptimal attractive critical points do arise for 
optimization of SU(A'^), and that this case is relevant for quantum control problems, 
but these can easily be eliminated by changing the performance index. We now turn 
our attention to the existence of sub-optimal attractive critical points for the actual 
optimization problem of interest, i.e., the problem of maximizing the fidelity over 
L^{0,T). In particular we are interested in whether there are suboptimal attractive 
critical points, i.e., points Uf{T) for which the gradient of'Sv vanishes, the Hessian is 
negative definite, and S^y(f7/(T)) < 1 for systems that are time-T controllable. 

Example 4. Traps for unitary operation optimzation problem. Consider 

the control problem specified by: 

/l+£ 0\ /o 1 0\ /e*"^ \ 

Ho= [ 1 , iJi = 1 b 1 , = ie'T e''^"" 
\ 2/ \0 1 c/ \ ie±*T/ 

with T = ^, whose gradient at f = can be calculated as 

l3mTT{V''Uf{T)Hi) = iasin((^) + |(6 + c) 003(7) 
and whose Hessian at the same / = expands to 

1 1^4 
- -wU[l] + 3a:E ^(4fc2_i) (n[sm(2fc£»)] + H[cos(2fe.)]) 

+ ^y(n[cos(.)] + n[sin(.)]) - ^^(n[cos(£.)] + H[sin(e.)]) 
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where the constants w,x,y,z are defined in [11] below. This formula for the Hessian 



again comes from applying the results of \Appendix A\ to the original operator expression 
\ jj [a;sin(£((T - t)) - (a^ cosv? - (6^±c^) sin7) 

0<CT<T<f 

+ ycos((T — t) — zcos(e(a- — r))] [q;(ct)/3(t) + a{T)l3{a)] dadr 

Therefore, a sufficient set of conditions for this operator to be negative definite, thus 
the fidelity to have a local maximum over all finite dimensional subspaces at f = 0, 
are the constraints: 

= a sin (/5 + (5 + c) cos 7, (l^a) 

< w ~ |:(cos7 — sin ip) + cos ip — (6^ ± c^) sin 7, (17b) 

> X = COS7 — sin(y3, (17c) 

> y = sin7 ± sin7, (17d) 

< z ~ cos (fi — sin 7. (17e) 

We are interested in whether these conditions, which are independent of e, can be 
satisfied for some choice of the other parameters, while having Hq,Hi satisfy the 
(exact-time) controllability Lie algebra rank condition for e = 0, so that by our uniform 
controllability result a sufficiently small choice of e would make this system exact-time 
controllable at T . Indeed, several such choices of parameters exist, for example: in the 
—i case, a = 5-\/2/3, = 4, c = 1, = 7 = fidelity ^ « 42%, or in the 

+i case, a = ^flji, 6 = -1, c = 0, (yS = f , 7 = -f with a fidelity of ^ + ^ ^ 82%. 
4-5. Variable time non-global maxima 

In typical quantum control problems, the target time T is fixed, but T can be allowed 
to vary. If the target time T of control problems is not fixed in advance, and is 
consequently allowed to vary in the corresponding optimization formulation, there is 
some ambiguity as to how proximity between control fields is to be measured when 
these lie in different spaces L^(0, T). We choose to resolve this difficulty by optimizing 
over Hamiltonians of the form H{t) = £Hq + f{t)Hi with T fixed, since propagating 
H{t) up to time T is equivalent to propagating Hq + jf{t/t)Hi up to time IT. The 
expressions for the gradient ^ and Hessian ( 10 1 derived earlier from the perturbative 



expansion can be adapted to this variable time optimization framework simply by 
replacing all instances of AfHi = Hf+Af ^ Hf by AIHq + AfHi. In particular, at 
a point where / = 0, we find the solution functional has gradient simplifying to 

{ia®a)^~i[ Uf{T,T)HiAf{T)Uf{T)dT -iU{T)THoAe (18) 
Jo 

while its Hessian is the bilinear map on M © iy^(0, T) given by 

(4©a,4©/3) - y" J Uf{T,T)HiUf{T,a)HiUf{a) [a(T)^(a) + /3(T)a(a)] da dr 

0<cr<r<T 

- U{T) f {[Uf{T)^HiUf{T),Ho]T + HoU{T)^HiUf{T)T) [a(r)4 + /3(t)£J dr 

-Uf{T)T^H^ejb. (19) 
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Example 5. In this framework we consider the unitary control problem specified by: 



Ho 



/I 


+ 


e 
















1 











1 










Hi = 


1 


6 1 















2 










1 c 


9 


V 





















5 


dj 






/ 


r 












-7)\ 

















—i 





















































re' 









with T = J, where a fortiori s = \/l — r^, and we shall set b = — -s/S), c = -s/S — 3, 

d = 3, g= ^J^^i]F+12^/^, r = 1(3 - v^) « 0.28, = |, 6* = -f mi/i 7 arbitrary. 
With these parameters, the gradient at f = 0, £ = 1 always vanishes, and the Hessian 

evaluates to 

-CT'^ - 0.80 n[l] - 1.60 (n[cos] + n[sin]) - 0.28 (n[cos(£»)] + n[siii(£»)]) 

00 . 

- E ^(4fc2 _ 1) (n[sin(2fe.)] + n[cos(2te.)]), 

which is always positive definite, since C , although dependent on e, must be at least 
3.5, and the fidelity here is j^(33 — 2-\/3) 62%. 

5. Practical Evidence for Trapping Behavior 

Having demonstrated the existence of suboptimal attractive critical points for 
quantum control optimization problems, the question arises how significant such points 
are in practical applications. The examples in the previous sections for which we 
can rigorously prove the existence of such bad points are clearly quite artificial, and 
mainly serve to prove a theoretical point. Unfortunately, it docs not appear that 
any theoretical machinery is available to provide a general answer to this question for 
far more complex problems that are of immediately practical interest, so we will use 
numerical simulations to give a glimpse at the range of possibilities. For this purpose, 
we consider the problem of implementing a quantum Fourier transform gate for a 
three-qubit Ising-coupled spin chain with full local control. Explicitly, the system has 
the time varying Hamiltonian 

2H{t) = ZiZ2+Z2Zs + fi{t)Xi+f2{t)Yi+fs{t)X2+f4{t)Y2 + f5{t)Xs + fe{t)Ys. (20) 

with six control fields /i,...,/6, where {X,Y, Z,I} are the unnormalized Pauli 

matrices and Xi is a shorthand for the tensor product X (g) 7 7, etc. The quantum 
Fourier transform objective V is specified by 



{j\V\k) 



(21) 



where uj is the canonical choice e '^tt/s^ ^j^g prefactor \/i = e^'^^^'^+i)/^ for some 
integer m is needed to ensure V e SU(2^) as our Hamiltonians are in 5u(2^). We 
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Figure 1. Histogram of errors at the termination for 100 selected successful 
runs (co-inciding with the 100 good runs plotted in Fig. ^ show that the good 
runs terminate with errors < 5 X 10~^, with many reaching smaller errors. The 
histogram of the errors for the 42 bad runs (inset) show that all but one of these 
terminate with errors > lO"'^, orders of magnitude larger than the terminal errors 
for the good runs. 



choose TO = 5. To solve the optimization problem numerically we use the standard 
parameterization of the controls fr in terms of piecewise constant functions over S 
intervals forming a uniform partition of [0,T], i.e., effectively we define fr(t) = Ors 
whenever (s — 1)^ < t < s^, and optimize over all Urs, s = 1, . . . , S variables. 

This system satisfies the condition of exact-time controllability over SU(8), the 
case we are usually interested in, although, as is generally the case, we do not know 
the critical time required to render the whole group accessible. Neither do we 
know how to ensure having a suitable number of time slices S, except that we must 
satisfy the dimensionality constraint 65* > 8^, otherwise the optimization will either 
get trapped or converge to unbounded fields, for almost all objectives V. However, the 
fact that we were always able to achieve errors 1 — g'y below 10^^° for 50 randomly 
selected target gates suggests that our choice of T = 8 is sufficiently large, and of 
S = 140 appropriate. Ideally, we would like control optimization to be effective even 
under the weaker condition that the target gate V be accessible at the given time T, 
using S time slices. This is certainly true, at the 10~^° accuracy level, for our V. 

For the optimization we primarily used a quasi-Newton method |24j . relying on 
the BFGS formula to construct an approximate Hessian, and an exact analytic formula 
for the gradients. This choice was motivated by related work in which this procedure 
performed well relative to various others, in particular at high fidelities. Each run 
of the algorithm was initialized by setting the a^s variables independently at random 
from a standard normal distribution, a choice which was deemed suitable considering 
the typical norm of high fidelity solutions. We carried out 1,000 optimization runs and 
found 958 of them successfully passed our pre-selected error threshold of 10~^, while 
of the remaining 42 all but one failed to reach even the 10~^ level. A histogram of the 
terminal errors for 100 selected good runs versus the 42 bad runs (Fig. [ij shows that 
the final fidelites achieved by the good runs are many orders of magnitude higher than 
the terminal fidelities for the bad runs. Fig. [2] further shows that the convergence 
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iteration number (n) iteration number n 

Figure 2. Convergence behavior for suceessful runs and trapped runs. Error 
1 — Sv{n) as a function of the iteration number n for 100 successful runs (left) 
shows rapid convergence to final fidelities in the range of IQ-iO and 10" m 
less than 500 iterations, while the error for the bad runs (right) remains virtually 
constant over thousands of iterations at values around 10~^, more than 8 orders 
of magnitude larger than the final error for the successful runs. 

behavior for the successful runs is distinctly different from that of the failed ones. 
Considering that the successful runs reached fidelities > 99% on average in a few 
dozen iterations taking mere seconds, the bad runs constitute effective traps in that 
the otherwise highly efficient quasi-Newton optimization routine gets stuck at these 
points and the runs are extremely time-consuming. 

We further explored the behavior of the fidelity in neighborhoods of these bad 
points by evaluating it for 10, 000 small random field perturbations around each of the 
presumed traps f* . The results show that the fidelities for all of these fields are smaller 
than the fidelity (f* ) at the presumed trap f* , suggesting that the f* are indeed in 
the trapping neighbourhood of a local maximum for all 42 runs that terminated at 
fidelities ^ 0.99. An example of a scatter-plot of the fidelities of the perturbed 
fields f as a function ||f — f*|| in a neighborhood of trap 1 is shown in Fig. |3] We 
know that this is not a result of the target time T being too short or the dimension of 
the subspace of L^ being too small as there are solutions for the fields for T = 8 and 
S = 140 with errors 1 — 5^y(f) on the order of lO"^**, close to machine precision. To 
ascertain whether these bad fields f* indeed attracted trajectories, we also restarted 
the optimization with perturbed fields f = f * + Si. Even when the size of the 
perturbations was increased to Sfm on the order of 0.1 the runs tended to remain 
stuck in the trapping region as illustrated in Fig. [4] Finally, we also tested other 
implementations, including several variants of sequential update algorithms, similar 
to the Krotov method without penalty, but starting with the bad points f* none of the 
runs managed to escape the trapping regions. It is difficult to ascertain if there is a 
single trap or possibly multiple traps in each region, and there are clearly limitations 
of this computational investigation, but taken together, the simulation results strongly 
suggest that there are suboptimal extrema and effective trapping regions, at least for 
the 6 X 140-dimensional search space we chose to optimize over. 

It is difficult to ascertain whether these bad points would constitute traps if each of 
our six control fields were not restricted to the 140-dimensional subspace of i^(0, 8) 
consisting of piecewise constant functions. To get a glimpse of how the convergence 
behavior changes when we enlarge the subspace of L^ we project onto, we subdivided 
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Figure 3. Fidelity sampled in a neighborhood of a typical trap. Sv{^) ~ Sv{^*) 
is negative, bounded away from 0, for all sample points, showing that the fidelity 
at all of these random samples is less than the fidelity at the presumed local 
extremum f* , suggesting that f, is indeed a local extremum. 
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Figure 4. Convergence near five randomly selected "traps" f*. For each f«, for 
clarity, the difference of the fidelity as a function of the iteration number, 5\/(n), 
and the fidelity of the nearest "trap" = is shown for 10 runs starting 

with f« +(5f. For each run the difference in the fidelites vanishes, i.e., the fidelities 
seem to converge to the respective fidelity value for the closest trap. Inset shows 
close-up of a small portion to better resolve the trajectories. 

the time grid three-fold to 5* = 420, and restarted the optimization, starting with the 
field f* for which the optimization got trapped on the original subspace. As one would 
expect, tripling the dimension of the parameter space we optimize over from 6 x 140 
to 6 X 420 = 2520 does increase the attainable fidelities slightly, but in all the cases we 
studied the improvements were small, and the optimization over the larger subspace 
exhibited trapping behavior similar to that on the original control space, terminating 
with substantially lower fidelities after many (time-consuming) iterations than typical 
successful runs (see Fig. Isl) . That refining the mesh would lead to higher fidelities is 
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Figure 5. Cumulative plot of the number of runs versus the final error. (1) after 
runs reached the 10~* error threshold or exceeded a thousand BFGS iterations 
(blue, largest), (2) after increase in fidelity dropped below 10~^ in additional 
Newton iteration steps (red, intermediate), (3) after increase in fidelity obtained 
by a Newton iteration remains below 5 X 10~^ even after sub-division of time 
intervals (yellow, smallest). The inset focuses on the failed runs. Since the rank 
order of the final errors between runs did not change much across the three stages, 
one can see that the gains made by going beyond a thousand iterations or refining 
the mesh are marginal. In fact, the largest reduction in the error obtained by 
refining the mesh was only 7%, orders of magnitude less than what would have 
been achieved by a successful run with less computational effort. 

rather inevitable, since even if there is a local maximum in infinite dimensions, any 
finite dimensional subspace will almost surely not contain this point, but the distance 
between increasingly larger subspaces and the point must decrease, tending to zero, 
allowing the fidelity to increase towards an asymptotic value. This phenomenon can 
conveniently be visualised by considering a non-degenerate quadratic potential on the 
plane and a generic line through the origin: the maximum on the line is lower than the 
maximum over the plane. The fact that fidelity increases so little (relative to the error) 
after we refine the mesh is, on the other hand, an indication that said asymptotic value 
is likely to strictly below the global maximum of 1, and the data certainly suggests 
that the neighborhoods of the f* are really bad neighborhoods for the optimization. 

6. Conclusion 

We have revisited the control landscape for several classes of canonical quantum control 
problems, in particular pure-state transfer and unitary gate optimization problems. 

Although the pure-state transfer fidelity as a function over the unitary group 
only has two types of critical points, corresponding to either the global minimum 
or the global maximum 1, detailed analysis shows that the class of critical points 
for the actual optimization problem over functions in i^(0,T) is strictly larger than 
the set of kinematic critical points. In particular, for any bilinear control system 
Hf{t) = Hq + f{t)Hi and any fixed target time T, there exist pairs of initial and 
target states, such that some / = const, is a critical point of the system, and we can 
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achieve any value of the fidehty between and 1 for such critical points. Moreover, 
while these critical points are not expected to be attractive in most cases, we have 
presented examples of systems with attractive suboptimal critical points, showing that 
traps do exist for such problems, and the fidelity at these traps can take many values, 
unlike the fidelity for kinematic critical points, which is limited to or 1 for pure-state 
optimization problems. 

For the problem of unitary operator optimization we demonstrated that there are 
indeed no traps when we consider the fidelity as a functional over the unitary group. 
Although there are critical manifolds on which the fidelity takes values between — 1 
(global minimum) and +1 (global maximum), all of these critical points are indeed 
saddle points, but the situation changes when the analysis is restricted to the special 
unitary group SU(A^). This case may appear artificial but it is actually highly relevant 
for quantum control, as many quantum control problems involve control Hamiltonians 
that have zero-trace, and hence we have no global phase control. Although the system 
may be U(iV) controllable when Tr[ffi] = 0, if the target time is fixed, so is the global 
phase. Perhaps surprisingly, there are more critical points in this case than for U(A''), 
and some of these are attractive critical points at which the fidelity assumes values 
< 1, i.e., traps. More careful analysis shows that these traps correspond to solutions 
U = e^^V, where e'^ is a root of unity, which immediately shows that these solutions 
have fidelity < 1, according to the standard definition of the fidelity. Nonetheless, these 
solutions are perfectly adequate for most practical purposes as we usually do not care 
about the global phase of an operator, and this problem can be avoided entirely simply 
by modifying the performance index to reflect the fact that we do not care about the 
global phase. Optimizating over the projective unitary group PU(iV) there are indeed 
no traps, at least on the group. Again, the situation is more complicated for actual 
optimization problem over controls in L^(0,T). Not only are there critical points / 
for which the fidelity assumes critical values other than those permitted over U(iV) 
or S\J{N), but examples again show that the Hessian at these critical points can be 
negative definite, implying that they arc locally attractive on any finitc-dimcnsional 
subspace and therefore traps. The results can even be extended to the case where the 
target time T is variable, again proving the existence of traps. 

The specific examples of traps constructed prove a theoretical point about 
the existence of traps, but they are not of particular interest in their own right. 
Unfortunately, due to the difficulty of exactly calculating the Hessian for complex 
systems with time-varying controls, it is difficult to prove the existence of traps for 
more interesting cases. Therefore we turned to numerical simulations to investigate 
any evidence for the existence of trapping regions in practice. For a three-qubit 
gate optimization problem we studied the simulation results suggests that there are 
genuine traps, i.e., attractive critical points for which the fidelity is below a certain 
threshold. Although such traps do not appear to be very common - in the example 
considered we observed trapping in about 42 out of 1000 cases, they do complicate the 
control landscape and obstruct convergence to a global extremum, and perhaps even 
more importantly, trapped runs can be extremely time-consuming, which suggests 
that it is important to consider the possibility of trapping in the algorithm design, 
to ensure that trapped runs are quickly identified and terminated. Furthermore, 
the example considered was a rather simple system of dimension N = 8 with six 
independently variable controls, ft is quite plausible that the number of traps and 
the likelihood of a local optimization algorithm converging to these increases with the 
system dimension or complexity. This is an important issue of practical concern that 
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requires a systematic further analysis. 
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Appendix A. Definition of Operators S and C 

We define the operators C and S via their action on functions a, /3 e i^(0, T) 

{l3\Ca)L2= J J cos{cj{a-T))[a{T)p{a) + l3{T)a{a)]dadT, (A.la) 

0<cr<T<T 

{l3\Sa)L^= j j sin(w(c7-r))[a(r)^((7)+^(T)Q(CT)]dCTdT (A.lb) 

0<(T<T<T 

For C, using the trigonometric identity cos(a — b) = cos(a) cos(6) + sin(a) sin(6), and 
symmetrization the double integral, we find 

C = H[sin(a;»)] + H[cos(w«)] (A.2) 

with Lu arbitrary. When ijj = S can be expressed in terms of its eigenfunction 
decomposition, constituted of the real Fourier basis 



A A 

+ E ^(4fc2 _ 1) (n[sin(2fca;.)] + n[cos(2fcu;.)]) (A.3) 



Recall that here H denotes the unnormalizcd projection onto its argument and • is a 
placeholder for the argument to an anonymous function. For example, H[sin(w»)] is 
the projector onto the function t ^ sm{ujt), i.e. the operator sending each element 
a e i2(0,T) to the element /o^sin(a;r)Q!(r) drsin(a;») in L'^{0,T). 
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